Skip to content

Retry evals on API error.#23322

Merged
alisa-alisa merged 1 commit intomainfrom
gundermanc/500s
Mar 21, 2026
Merged

Retry evals on API error.#23322
alisa-alisa merged 1 commit intomainfrom
gundermanc/500s

Conversation

@gundermanc
Copy link
Copy Markdown
Member

@gundermanc gundermanc commented Mar 21, 2026

Summary

Retry behavioral evals tests on API errors.

@gundermanc gundermanc marked this pull request as ready for review March 21, 2026 01:00
@gundermanc gundermanc requested review from a team as code owners March 21, 2026 01:00
@gemini-code-assist
Copy link
Copy Markdown
Contributor

Summary of Changes

Hello, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request enhances the robustness of behavioral evaluation tests by introducing an automatic retry mechanism. This change aims to mitigate test failures caused by intermittent API errors, ensuring more reliable test outcomes, particularly within continuous integration pipelines.

Highlights

  • Test Retries: Implemented retry logic for behavioral evaluation tests to prevent failures due to transient API errors, especially in CI environments.
Ignored Files
  • Ignored by pattern: .github/workflows/** (1)
    • .github/workflows/evals-nightly.yml
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request adds a retry mechanism to the behavioral evaluation tests, which is a great step towards improving CI stability.

@github-actions
Copy link
Copy Markdown

Size Change: -4 B (0%)

Total Size: 26.1 MB

Filename Size Change
./bundle/chunk-BPKLOIVU.js 0 B -3.64 MB (removed) 🏆
./bundle/chunk-ZGHXR4UA.js 0 B -14.5 MB (removed) 🏆
./bundle/core-N43CUAUG.js 0 B -42.2 kB (removed) 🏆
./bundle/devtoolsService-RDENZ3NU.js 0 B -27.7 kB (removed) 🏆
./bundle/interactiveCli-O5WSQU7O.js 0 B -1.61 MB (removed) 🏆
./bundle/oauth2-provider-O7LADRNN.js 0 B -9.16 kB (removed) 🏆
./bundle/chunk-KF5RCNHV.js 14.5 MB +14.5 MB (new file) 🆕
./bundle/chunk-LS3CM2L4.js 3.64 MB +3.64 MB (new file) 🆕
./bundle/core-4KQP7PMC.js 42.2 kB +42.2 kB (new file) 🆕
./bundle/devtoolsService-IZOGH3S5.js 27.7 kB +27.7 kB (new file) 🆕
./bundle/interactiveCli-EGTGU3UL.js 1.61 MB +1.61 MB (new file) 🆕
./bundle/oauth2-provider-5KDPSEAB.js 9.16 kB +9.16 kB (new file) 🆕
ℹ️ View Unchanged
Filename Size
./bundle/chunk-34MYV7JD.js 2.45 kB
./bundle/chunk-5725SFQR.js 1.95 MB
./bundle/chunk-5AUYMPVF.js 858 B
./bundle/chunk-664ZODQF.js 124 kB
./bundle/chunk-DAHVX5MI.js 206 kB
./bundle/chunk-IUUIT4SU.js 56.5 kB
./bundle/chunk-RJTRUG2J.js 39.8 kB
./bundle/devtools-36NN55EP.js 696 kB
./bundle/dist-T73EYRDX.js 356 B
./bundle/gemini.js 519 kB
./bundle/getMachineId-bsd-TXG52NKR.js 1.55 kB
./bundle/getMachineId-darwin-7OE4DDZ6.js 1.55 kB
./bundle/getMachineId-linux-SHIFKOOX.js 1.34 kB
./bundle/getMachineId-unsupported-5U5DOEYY.js 1.06 kB
./bundle/getMachineId-win-6KLLGOI4.js 1.72 kB
./bundle/memoryDiscovery-OV4FUTHJ.js 922 B
./bundle/multipart-parser-KPBZEGQU.js 11.7 kB
./bundle/node_modules/@google/gemini-cli-devtools/dist/client/main.js 221 kB
./bundle/node_modules/@google/gemini-cli-devtools/dist/src/_client-assets.js 227 kB
./bundle/node_modules/@google/gemini-cli-devtools/dist/src/index.js 11.5 kB
./bundle/node_modules/@google/gemini-cli-devtools/dist/src/types.js 132 B
./bundle/sandbox-macos-permissive-open.sb 890 B
./bundle/sandbox-macos-permissive-proxied.sb 1.31 kB
./bundle/sandbox-macos-restrictive-open.sb 3.36 kB
./bundle/sandbox-macos-restrictive-proxied.sb 3.56 kB
./bundle/sandbox-macos-strict-open.sb 4.82 kB
./bundle/sandbox-macos-strict-proxied.sb 5.02 kB
./bundle/src-QVCVGIUX.js 47 kB
./bundle/tree-sitter-7U6MW5PS.js 274 kB
./bundle/tree-sitter-bash-34ZGLXVX.js 1.84 MB

compressed-size-action

@SandyTao520 SandyTao520 enabled auto-merge March 21, 2026 01:09
@SandyTao520 SandyTao520 added this pull request to the merge queue Mar 21, 2026
@gemini-cli gemini-cli bot added the status/need-issue Pull requests that need to have an associated issue. label Mar 21, 2026
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Mar 21, 2026
@gundermanc gundermanc added this pull request to the merge queue Mar 21, 2026
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Mar 21, 2026
@alisa-alisa alisa-alisa added this pull request to the merge queue Mar 21, 2026
Merged via the queue into main with commit 28935d1 Mar 21, 2026
27 checks passed
@alisa-alisa alisa-alisa deleted the gundermanc/500s branch March 21, 2026 03:04
ProthamD pushed a commit to ProthamD/gemini-cli that referenced this pull request Mar 29, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

status/need-issue Pull requests that need to have an associated issue.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants